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Abstract 

We present a short proof of Szemeredi's Theorem using a dynamical system 
enriched by ideas from model theory. The resulting proof contains features remi- 
niscent of proofs based on both ergodic theory and on hypergraph regularity. 

1 Introduction 

Szemeredi's Theorem states: 

Theorem 1. For any 5 > and any k, there is an n such that whenever N > n, 
^ ^ [li N], and \ A\ > SN, there exists an a and a d such that 

a, a + d, a + 2d, . . . , a + {k - l)d e A. 

Szemeredi's original proof lfT4l used graph theoretic methods, in particular the Sze- 
meredi Regularity Lemma [ 15 |. Shortly after, Furstenberg gave a different proof |I5] ID, 
based on a correspondence argument which translates the problem into one in ergodic 
theory. Beginning with a new proof given by Gowers fT], a number of new proofs 
have been developed in the last decade. (Tao counts a total of roughly sixteen different 
proofs I.19J.) 

Hrushovski has recently used a stronger correspondence-type argument |l9] to make 
progress on a similar combinatorial problem (the so-called non-commutative Freiman 
conjecture). In this paper, we use Hrushovski's method to give a short proof of Sze- 
meredi's theorem. 

The proof here bears a similarity to proofs based on hypergraph regularity, such as 
im [12] [T3] [16]; in particular the proof is very similar to the infinitary regularity-like 
arguments introduced by Tao ifTTl and used by Austin to prove both Szemeredi's The- 
orem (T\ and generalizations |T| . Indeed, this proof was inspired by noticing that the 
use of "wide types" (countable intersections of definable sets of positive measure) in 
Hrushovski's arguments was analogous to the use of the regularity lemma in finitary 
arguments. (In fact, Hrushovski essentially sketches a proof of the k — 3 case of Sze- 
meredi's Theorem in [9J; however his arguments depend on stability theoretic methods 
which don't seem to generalize to higher k. This seems related to the fact that stability 
implies 3-amalgamation, but not 4-amalgamation.) 
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The methods here are also reminiscent of those used by Tao to prove the "diagonal 
ergodic theorem" 1 18 1, and especially to our infinitary reformulation of that proof {20^. 
This paper might shed light on the connection between that method and the technique 
of "pleasant extensions" used by Austin |3 1. 

We are grateful to the members of UCLA's reading seminar on |9|: Matthias As- 
chenbrenner, Isaac Goldbring, Greg Hjorth, Terence Tao, and Anush Tserunyan. 



2 A Correspondence Principal 

Suppose that Szemeredi's Theorem fails; that is, for some S > and every n, there is an 
An C [l,n] with \ An\ > Sn such that An contains no /c-term arithmetic progression. 
We must first make a technical adjustment: we view the sets An as subsets of the group 
[1 , 2n + 1] . Then | An \ > Sn/2~e where e as n oo, and An contains no fc-term 
arithmetic progressions in this groupQ 

We extend the language of groups with a predicate symbol A and the following 
additional class of formulas: 

• Whenever ai, . . . , a^, 7 is a sequence of rationals, x a tuple, and Bi{x, yi), . . . , Bk{x, yk) 
a sequence of formulas, / J2i<k ' Bidm{x) > 7 is a formula with free vari- 
ables yi 



We let ([1, 2n + 1], A,i) be models, interpreting the symbol A by An and 
l^cti- (j)^dm{x) > 7 



i<k 



i<k 



We write as an abbreviation for / 1 • (\)dm{x), and sometimes omit x when it is 

clear from context. 

Form an ultraproduct of those groups [1, 2n + 1] such that 2n + 1 is prime. We 
obtain a model (G, + , A). By transfer, the formula 

^a,d(a ^ A h a ^ d ^ A h ■ ■ ■ a + d ^ V d ^ A) 

is false. 

Observe that for any countable set M and any n, the sets of n-tuples definable 
with parameters from M form an algebra of internal sets, and using the Loeb measure 
construction, we may extend the internal counting measure on this model to a measure 
/i" on the (T-algebra of Borel sets generated from these definable sets (for basic facts 
about this construction, see Q). The measures /i" satisfy Fubini's Theorem irolfTTI : 
that is, 

^ = [[ fdfi^^df,"' 



'without this modification. An might contain no arithmetic progressions as a set of integers, but contain 
arithmetic progressions as a subset of the group, since the group contains progressions which "wrap around": 
for instance, 3, 10, 4 is an arithmetic progression in [1, 13], but does not correspond to an arithmetic pro- 
gression in the integers. Expanding the group by adding a "dead zone" disjoint from A„ is a standard way 
of avoiding this problem. 
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where no +ni = n. 

Lemma 2. If(G,+,A) N to((^) > j then ij.\^\{{x \ (f){x)}) > 7. 
//(G,+, A) N ^m(0) > jthen ^i\^\{{x\ (l){x)}) < 7. 

Proof. For the first part, since (G, +, A) N m{(p) > 7, for almost every n, ([1, 2n + 

1], +, A2n+i) N m{(j)) > 7, and therefore 

\{x\m}\ ^ ^, 

(271+1)15^1 T 

holds in ([1, 2n + 1], ^2n+i). (Here </> may contain parameters.) 
But since this holds for almost every n, by transfer 

\{x\m}\ ^ 

(2n+ 1)1^1 ^ 

holds in (G, +, A) where the inequality is between nonstandard rational numbers. This 
means that 

For the second part, suppose (G, +, A) \= ^m{<t)) > 7. Then for almost every n, 
([1, 2n + 1], +, A2n+i) 1= -''m{(j)) > 7, and therefore 

(2n+ 1)1^1 

holds in ([1, 2n + 1], ^2n+i). But since this holds for almost every n, by transfer, 

(2n+ 1)1^1 

holds in (G, +, A), so also 

□ 

Since both m and /U are additive, this extends innmediately to the corresponding 
integrals. 

As a notational convenience, let us write x for a sequence xi, . . . ,Xk, and xj for the 
sequence xi, . . . , Xi^i, . . . ,Xk and a?^ for the sequence xi, . . . , Xi-i,Xi+i, . . . , Xk-i- 

Observe that for any 6, the function a a+b is a definable bijection, as is a fc-a 
for any integer k. For r < A:, we define formulas Ar on xr- When r < k,we define 

Ar{xr) ■.<^ ^ i-Xi+r-\xk- ^ a;j e A. 
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We define 

i<k 

Note that for any x ^_-^ j^ , | e ^fc}) = m(^)> so by the Fubini property of 

these measures, = n{A). 

Define := X^j</s Xi; it is easy to see that for any aj-^, the function Oj i-^ 

<j{a-j^) is a bijection. Whenever Ak{ia,-^), also Ai(a^, cr(a^)) for each i < k. In 

particular, if we define := {a^ | Afc(a^)AVi < A;Ai(aj-^, cr(a-jf))}, wehave A = A/j, 
and so ^*=-i(i) = fi(A) > 0. 

In the next section, we will show that, under these conditions, l^'' (CliKk ^i) > 
0. First, however, we show that this is enough to prove Szemeredi's Theorem. If 
l^'' ir\i<k ^i) > 0' a G nj<fe^* su'^h that aj. 7^ I]j<fe a^. Setting 

a := i • fli and d := Uk — J2i<k ^i' ^® a + id S A for i G [0, fc — 1]. 

Therefore a, o + rf, . . . , a + (fc — l)d is a fc-term arithmetic progression in A. 

This contradicts the construction of the model, which in turn means that the initial 
assumption that the sets An C [1,N] exist must fail. Therefore for every 6, there is 
an N such that for every A^ C [1, N] with \An\ > 6n, An contains an arithmetic 
progression of length fc. 

3 Amalgamation 

Definition 3. Let M C G be a set, let n be a positive integer, and let / C [1, n] foe 
given. We define Bn,i{M) to be the Boolean algebra of subsets ofG" of the form 

{{xi,...,Xn) I <f>i{Xi}iel)} 

where is a formula with parameters from M. 

We write ('^) for the collection of subsets of S with cardinality fc. When k <n,we 
define Bn,k{.M) = V/e([i."i) Bn,i{,M). 

We also define Bn,<i{M) = 1) 

IfB is any Boolean algebra, we write B'^ for the a -algebra generated by B. 

We equate formulas with the sets they define, so we will also speak of B as being a 
Boolean algebra of formulas. 

Lemma 4. Let B, Bo, Bi be Boolean algebras with B C Bq fl Bi. Suppose there is an 
elementary submodel M such that: 

• Every parameter in every formula in B\ belongs to M, 

• If(l){x, a) G Bo,bG M, and \a\ = \b\, then (j){x, b) G B. 

Thenforany f G L^B^ I - E{f \ B^)\\ = 0. 

An illustrating case is when B = Bn,k{M), Bq = Bn,k{M U N), and Bi = 
B„,fc+i(M). 
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Proof. Suppose not. Then setting e := ||/ - E{f \ B'')\\l2 and 5 := \\f - E{f \ 
Vq)\\i^2, we must have 6 < e. For some . . . , and Ai{x, bi), . . . , Am{x, bm) & 
Bi with each h S M, we have ||/ - ^.<„ Axa. {x, 5^)11^2 < (e - (5)/4. 

Since \ \f — E[f \ Bq)\ \ =5, there are ai, . . . , a„ and Di{x,di), . . . , Dn{x, a„) G 
Vq with 11/ - Ej<„Q:iXDi(x,6i)||z,2 < e - (e - (5)/2, and therefore 

II ^PiXAi{x,bi) - ^aiXDi{x,hi)\W < e - 3(e - ^)/4. 

i<m i<n 

This means the formula 

3yi, . . . , fm-'d I ^ AXAi (:r, bi) - ^ aiX^i Vi) I li^ > [e - 3(e - S)/4f) 

is satisfied (where, to view this as a formula, we expand the norm into an integral of 
sums of definable formulas). 

This is a formula with parameters from M, so by the elementarity of M, there are 
witnesses a'^, . . . , in M satisfying: 

1 1 J2 PiXA, {x, bi) - ^ aiXD, {x, 3fi)\\L2<e- 3(e - S)/A. 

i<m i<n 

It follows that 11/ - Y.i<n'^iXDi{x,a'i)\\L2 < e, and since ^iXCi ^i) 

is measurable with respect to V, this contradicts the assumption that ||/ — E{f \ 
2?")||L^=e. □ 

Theorem 5. Let n < k and suppose that for each each I C [l,n] with \I\ = k, 
we have a set Aj e B^ j(-^)> suppose there is a 6 > such that whenever 
Bi e Bn i{M) and fi'^iAi \ Bj) < 5 for all I G (I^f 1), f]Bi is non-empty. Then 
/^"(ae(U,"i) Ai) > 0. 

Proof. We proceed by main induction on k. When fc = 1, the claim is trivial: we must 
have fJb{Ai) > for all /, since otherwise we could take Bj = 0; then /u(p| Ai) = 
Y{lJi{Aj) > 0. So we assume that A; > 1 and that whenever Bj G Bn,i{M) and 
H"'{Ai \ Bi) < 6 for all /, H/ep."!) non-empty. Throughout this proof, the 

variable I ranges over ('^^"') . 

Claim 1. For any Iq, 

( ixA,, - E{XA,, I S;<7„(M))) n XA,dii^ = 0. 

Proof. When k = n, this is trivial since H/^^/o XAj is an empty product, and therefore 
equal to 1. 
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If A; < n, we have 



/ ixA,,-EixA,, I S::,</„(M))) n XA.df,^ 
= I {XA,^-E{XA,^ I B^,<,,(M))) n XA,d^\{xi}ieIo)d^i''-H{xiUIo)■ 



Observe that for any choice of {aJj^/Q, Bn,<io {M), Bn,<io (A/"U{aj}i0/J, B„,/o (M) 
satisfy the preceding lemma, so 

mXAr, I Bl^^j^{M)) - EixAr, I B^,<,„(M U {aJ,^,J)|U. = 0. 

The function Ylj^ XA, {{xi}tei„ , {ai}i^io ) is measurable with respect to S^_<7g (MU 
{ai}). Combining these two facts, we have 



= / (XA,, - i;(XA,, I B^,<,,(Mu{aOi^/o))) n XA.d/({a;0) 
=0 

Since this holds for any {aiji^i^, the claim follows by integrating over all choices of 
{ai}. H 

Claim 2. For any Iq, there is an A'j^ e (M) such that: 

• Whenever Bj e Bn,i{M) for each I, ij,^^{Aj \ Bj) < 6 for each I ^ Iq, and 
fi'^{A'j^ \ Bjg) < 5, P|^g^[i,™]^ Bj is non-empty, and 

. n nz^7„ Ai) > 0, fi^if] Aj) > 0. 

Proof Define A',^ := {x^ \ E{xas, I B-^^j^{M)){xi) > 0}. If M"(A',^nn,^,„ Aj) > 
then we have 



/ E{XA,, I <,,(M)) n XA,rf/x" > 



and by the previous claim, this imphes that yu(pl Aj) > 0. 

Suppose that for each I, Bj e Bnj{M) with /i"(A7 \ Bj) < 6 for 7 7^ 7o and 
/^"(A'j^ \ S/) < 5. Since 



\ A',J = y XA.,(1 -XA^JrfM" = y i5(XA., I Bl^Jil-XA'Jdn- = 0, 
we have /i"(A7g \ S/g ) < 5 as well, and therefore f] Bj is non-empty. H 
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By applying the previous claim to each I G we may assume for the rest of 

the proof that for each /, Ai G 6^ <^(M). 

Fix some finite algebra B C Bn,k-i{M) so that for every /, ||xai — E{xai \ 
B) 1 1^,2 < — - (such a B exists because there are finitely many / and each Ai is 

Bl fc_i(M)-measurable). For each I, set A*j = {Hi \ E{xa, | B){a) > jiv^}- 

Claim 3. For each I, \A}) <6/2 

Proof. Aj \ Aj is the set of points such 
Chebyshev's inequality, the measure of this set is at most 



Proof. Aj \ A J is the set of points such that XAi — E{xai | B){a) > By 



:(^^+lf j{XM-E{xA, I B)fdn = {(^^+inxM-E{xA, I B)\\l. < ^. 



Claim 4. ^^{[\, Ai) > ^(0, A})/ ((^) + l). 
Proof. For each Jo, 



P^{{Al\A,,)^^ f]A*)= f xa',^{1-XaJ n XAjd/x" 

= JxA^,^{l-E{xA,JBj}l[xA',dn" 

But then 



,j,{f]A*\f]Aj) <J2^i{Al\Ar,)n n A*)<-^^(f|AJ). 



Each A*j may be written in the form Ui<r/ where A*j ^ = f]j^^ i ^ A*j ^ j and 

A*j ,^ J is an element of Bn,j{M). We may assume that if i^^i' then A} ^ n A} ^, = 0. 
We have 

Mn^j)=M u n n a*.,^,/)- 

For each i e n/[l>''J^]' .Dj = n/Dje^ ' ) ^^^^ ^*i,ii,J ^ element 

of Bn,j{M), so we may group the components and write 2?^- = Hje^ " ) ^? j where 
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Suppose, for a contradiction, that //(H/ ^/) = 0. Then for every i G II/Il' ^^]' 
/z(-Dj) = /x(plj-Djj) = 0. By the inductive hypothesis, for each 7 > 0, there is 
a collection e' B„,j(M) such that )u(^?,7 \ j) < 7 and flj-Bi j = 0- In 
particular, this holds with 7 = ,(^^^)(n, ',)(.nax, - ' 

For each I,i < n, J C I, define 



Claims. t^{Al.j\B*)<„ 



Proof. Observe that if a; e A*j ^ j\B*j ^ j then for some i with ij = i, x ^ B:^ j U 
U/'Dj,/'^j^/',V,j- This means a; ^ anda; e fl/o = -Dj j. So 

6 



»6n/[i,'-/] 



2(fc!i)(max/^/)' 



Define i?;=U<.,n,7i?|,i,j. 
Claim 6. ^(A/ < 6. 

Proof. Since /^(A/ \ AJ) < d/2, it suffices to show that iJ,{A*j \ B}) < 6/2. 

f,{A} \ y fl Bi,^j) = M ( u n ^h,j \ u n 



i J 



i J 



i<ri \ J J ) 

i<.ri J 

( k \ 6 

< ri 



^-V 2(^^J(maxjr/) 



< S/2 



Claim 7. 



7 J 
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Proof. Suppose a; € H/ ^/ = H/ Ui<rj Hj^/.i,,/- Then for each /, there is an 
ii < rj such that x G C]j B'} j. Since B} j C A} j, for each / and J C I, 

For any J, let I D J. Then 



In particular, x G -Sj j U U/o j I'jtii^r ,j) the particular i we have chosen. 
Since x € A} ^ ^ jfor each /' D J, it must be that x £ Bj j. This holds for any J, so 

From our assumption, Hj ^/ is non-empty, and therefore there is some i such that 
PljiJjj. But this leads to a contradiction, so it must be that ij{f]jA*j) > 0, and 
therefore, as we have shown, fJ,{f]j Aj) > A*j) > 0. □ 
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